The Annals of Statistics 

2011, Vol. 39, No. 4, 2131-2163 

DOI: 10.1214/11-AOS895 

© Institute of Mathematical Statistics. 2011 



GLOBAL SELF-WEIGHTED AND LOCAL QUASI-MAXIMUM 
EXPONENTIAL LIKELIHOOD ESTIMATORS FOR 
ARMA GARCH/IG ARCH MODELS 

By Ke Zhu and Shiqing Ling 1 

Hong Kong University of Science and Technology 

This paper investigates the asymptotic theory of the quasi-maxi- 
mum exponential likelihood estimators (QMELE) for ARMA-GARCH 
models. Under only a fractional moment condition, the strong con- 
sistency and the asymptotic normality of the global self-weighted 
QMELE are obtained. Based on this self-weighted QMELE, the lo- 
cal QMELE is showed to be asymptotically normal for the ARMA 
model with GARCH (finite variance) and IGARCH errors. A formal 
comparison of two estimators is given for some cases. A simulation 
study is carried out to assess the performance of these estimators, 
and a real example on the world crude oil price is given. 



1. Introduction. Assume that {yt :t = 0, ±1, ±2, . . .} is generated by the 
ARMA-GARCH model 



(1.1) y t = fj, + fayt-i + y~) ipjSt-j + £t, 

i=l i=l 

r s 

(1.2) e t = VtVh and h t = a + ^ + &^-*> 



i=\ t=l 



where «o > 0, oti > (i = 1, . . . ,r), f3j > (j = 1, . . . , s), and rj t is a sequence 
of i.i.d. random variables with Eijt = 0. As we all know, since Engle (1982) 
and Bollerslev (1986), model (1.1)— (1.2) has been widely used in economics 
and finance; see Bollerslev, Chou and Kroner (1992), Bera and Higgins 
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(1993), Bollerslev, Engel and Nelson (1994) and Francq and Zakoi'an (2010). 
The asymptotic theory of the quasi-maximum likelihood estimator (QMLE) 
was established by Ling and Li (1997) and by Francq and Zakoi'an (2004) 
when Ee\ < oo. Under the strict stationarity condition, the consistency 
and the asymptotic normality of the QMLE were obtained by Lee and 
Hansen (1994) and Lumsdaine (1996) for the GARCH(1,1) model, and by 
Berkes, Horvath and Kokoszka (2003) and Francq and Zakoi'an (2004) for the 
GARCH(r, s) model. Hall and Yao (2003) established the asymptotic theory 
of the QMLE for the GARCH model when Ee\ < oo, including both cases 
in which Erjf = oo and Ei]f < oo. Under the geometric ergodicity condition, 
Lang, Rahbek and Jensen (2011) gave the asymptotic properties of the modi- 
fied QMLE for the first order AR-ARCH model. Moreover, when E\£ t \ L <oo 
for some i > 0, the asymptotic theory of the global self- weighted QMLE and 
the local QMLE was established by Ling (2007) for model (1.1)-(1.2). 

It is well known that the asymptotic normality of the QMLE requires 
Erjf < oo and this property is lost when Erjf = oo; see Hall and Yao (2003). 
Usually, the least absolute deviation (LAD) approach can be used to reduce 
the moment condition of rjt and provide a robust estimator. The local LAD 
estimator was studied by Peng and Yao (2003) and Li and Li (2005) for the 
pure GARCH model, Chan and Peng (2005) for the double AR(1) model, 
and Li and Li (2008) for the ARFIMA-GARCH model. The global LAD 
estimator was studied by Horvath and Liese (2004) for the pure ARCH 
model and by Berkes and Horvath (2004) for the pure GARCH model, and 
by Zhu and Ling (2011a) for the double AR(p) model. Except for the AR 
models studied by Davis, Knight and Liu (1992) and Ling (2005) [see also 
Knight (1987, 1998)], the nondifferentiable and nonconvex objective function 
appears when one studies the LAD estimator for the ARMA model with 
i.i.d. errors. By assuming the existence of a -y/n-consistent estimator, the 
asymptotic normality of the LAD estimator is established for the ARMA 
model with i.i.d. errors by Davis and Dunsmuir (1997) for the finite variance 
case and by Pan, Wang and Yao (2007) for the infinite variance case; see 
also Wu and Davis (2010) for the noncausal or noninvertible ARMA model. 
Recently, Zhu and Ling (2011b) proved the asymptotic normality of the 
global LAD estimator for the finite/infinite variance ARMA model with 
i.i.d. errors. 

In this paper, we investigate the self-weighted quasi-maximum exponen- 
tial likelihood estimator (QMELE) for model (1.1)— (1.2). Under only a frac- 
tional moment condition of £t with Erfi < oo, the strong consistency and 
the asymptotic normality of the global self-weighted QMELE are obtained 
by using the bracketing method in Pollard (1985). Based on this global self- 
weighted QMELE, the local QMELE is showed to be asymptotically normal 
for the ARM A-G ARCH (finite variance) and -IGARCH models. A formal 
comparison of two estimators is given for some cases. 
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Fig. 1. The Hill estimators {& v (k)} for {rjt 2 }- 



To motivate our estimation procedure, we revisit the GNP deflator exam- 
ple of Bollerslev (1986), in which the GARCH model was proposed for the 
first time. The model he specified is an AR(4)-GARCH(1, 1) model for the 
quarterly data from 1948.2 to 1983.4 with a total of 143 observations. We 
use this data set and his fitted model to obtain the residuals {fit}. The tail 
index of {rjf } is estimated by Hill's estimator &rj(k) with the largest k data 
of {fit}, that is, 



Ej=l( lo g^l43-j -logr/143-fc) 

where fjj is the jth order statistic of {ftf}. The plot of {a v (k)} 7 k ° =l is given 
in Figure 1. From this figure, we can see that a v (k) > 2 when k < 20, and 
a„(k) < 2 when k > 20. Note that Hill's estimator is not so reliable when k is 
too small. Thus, the tail of {/?|} is most likely less than 2, that is, Erjf = oo. 
Thus, the setup that rit has a finite forth moment may not be suitable, and 
hence the standard QMLE procedure may not be reliable in this case. The 
estimation procedure in this paper only requires Erfi < oo. It may provide 
a more reliable alternative to practitioners. To further illustrate this advan- 
tage, a simulation study is carried out to compare the performance of our 
estimators and the self- weighted/local QMLE in Ling (2007), and a new real 
example on the world crude oil price is given in this paper. 

This paper is organized as follows. Section 2 gives our results on the 
global self-weighted QMELE. Section 3 proposes a local QMELE estimator 
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and gives its limiting distribution. The simulation results are reported in 
Section 4. A real example is given in Section 5. The proofs of two techni- 
cal lemmas are provided in Section 6. Concluding remarks are offered in 
Section 7. The remaining proofs are given in the Appendix. 

2. Global self-weighted QMELE. Let 6 = {i,5')' be the unknown pa- 
rameter of model (1.1)— (1.2) and its true value be #o, where 7 = (//, (j>\, . . . , (f> p , 
ipi,..., ip q y and 5 = (ao, . . . , a r , f}\, . . . , f3 s )'. Given the observations {y n , . . . , 
y±} and the initial values Yq = {yo,y~i, ■ ■ • }, we can rewrite the parametric 
model (1.1)-(1.2) as 

p q 

(2-1) e t (7) = y t - n - ^ 4>iVt-i - ^ Vi £ t-i(7)) 

i=l i=l 



m(0) = et(l)/Vht(0) and 

(2.2) 

r s 

^ai£ i 2 _ i (7)+5Z' 

i=l i=l 



h t {9) = a + ya i E 2 t _ l { 1 ) + yl(3 i ht-i{e). 



Here, %(#o) = rjt, £4(70) = £t an d ht(6o) = ht- The parameter space is = 
G 7 x @ s , where 7 C RP+1+ 1 , e 5 C R r +S+1 , R = (-00,00) and R = [0,oo). 
Assume that 7 and ©,5 are compact and #0 is an interior point in 0. 
Denote a(z) = YJ i=x aiz\ f3(z) = 1 - Y%=iPi z \ <M 2 ) = 1 ~ ZS=i<^ and 
ip(z) = 1 + Yli=i ^iZ 1 - We introduce the following assumptions: 

Assumption 2.1. For each 8 e 9, <f>(z) / and ip(z) / when |z| < 1, 
and <j>(z) and tp(z) have no common root with (j> p 7^ or ij)q 7^ 0. 

Assumption 2.2. For each € 0, a(z) and /3(z) have no common root, 
a(l) / 1, a r + p s £ and £? =1 ft < 1. 

Assumption 2.3. rfi has a nondegenerate distribution with Erf < 00. 

Assumption 2.1 implies the stationarity, invertibility and identifiability 
of model (1.1), and Assumption 2.2 is the identifiability condition for mo- 
del (1.2). Assumption 2.3 is necessary to ensure that rf is not almost surely 
(a.s.) a constant. When r\ t follows the standard double exponential distri- 
bution, the weighted log-likelihood function (ignoring a constant) can be 
written as follows: 



1 n / 

(2.3) L sn {6) = - y Wt l t (9) and l t {6) = log ^/h^f) + 



t=i 



k*(7)l 



where wt = w(yt-i,yt-2, ■ ■ ■) an d to is a measurable, positive and bounded 
function on R z ° with Zq = {0, 1,2,. . .}. We look for the minimizer, 6 sn = 
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(I'sn^'snY, of L sn (6) on 9, that is, 



9 



argmin L sn (6). 
e 



Since the weight wt only depends on {yt} itself and we do not assume that r\t 
follows the standard double exponential distribution, 9 sn is called the self- 
weighted quasi-maximum exponential likelihood estimator (QMELE) of 9q. 
When ht is a constant, the self-weighted QMELE reduces to the weighted 
LAD estimator of the ARMA model in Pan, Wang and Yao (2007) and Zhu 
and Ling (2011b). 

The weight Wt is to reduce the moment condition of e% [see more discus- 
sions in Ling (2007)], and it satisfies the following assumption: 

Assumption 2.4. E[(w t + Wt)£%-i\ < 00 f° r an y P G (°; 1)> where £ pt = 



When wt = 1, the 9 sn is the global QMELE and it needs the moment 
condition E\et\^ <co for its asymptotic normality, which is weaker than the 
moment condition Ee\ < oo as for the QMLE of 9q in Francq and Zako'ian 
(2004). It is well known that the higher is the moment condition of £j, the 
smaller is the parameter space. Figure 2 gives the strict stationarity region 
and regions for E\et\ 2 '' < oo of the GARCH(1,1) model: et = i]t\' r ht and 
ht = do + a\e\_i + f3iht-i, where rj t ~ Laplace(0, 1). From Figure 2, we can 
see that the region for ^|et| 01 < oo is very close to the region for strict 
stationarity of et, and is much bigger than the region for Ee\ < oo. 

Under Assumption 2.4, we only need a fractional moment condition for 
the asymptotic property of 6 sn as follows: 

Assumption 2.5. E\et\ 2i < oo for some i > 0. 

The sufficient and necessary condition of Assumption 2.5 is given in The- 
orem 2.1 of Ling (2007). In practice, we can use Hill's estimator to estimate 
the tail index of {yt} and its estimator may provide some useful guidelines 
for the choice of l. For instance, the quantity 2l can be any value less than 
the tail index {yt}- However, so far we do not know how to choose the op- 
timal l. As in Ling (2007) and Pan, Wang and Yao (2007), we choose the 
weight function wt according to i. When i = 1/2 (i.e., E\et\ < oo), we can 
choose the weight function as 



\ k k=l ) / 

where C > is a constant. In practice, it works well when we select C as 
the 90% quantile of data {yi, . ■ ■ ,y n }- When q = s = (AR-ARCH model), 



1 + T l ZoP l \vt-i 



(2.4) 




aT 0.5- 
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Parameter region of GARCH(1,1) model 




Fig. 2. The regions bounded by the indicated curves are for the strict stationarity and 
for E\et\ 2 ' 1 < co with i = 0.05, 0.5, 1, 1.5 and 2, respectively. 



for any i > 0, the weight can be selected as 

/ f _ P+r 1 
w t = max< 1,(7 1 ^ \y t ~k\I {\yt-k\ > C} 



k=l 



When l S (0, 1/2) and q > or s > 0, the weight function need to be modified 
as follows: 



wt = max< 



fc=i 



Obviously, these weight functions satisfy Assumptions 2.4 and 2.7. For more 
choices of wt, we refer to Ling (2005) and Pan, Wang and Yao (2007). We 
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first state the strong convergence of 9 sn in the following theorem and its 
proof is given in the Appendix. 

Theorem 2.1. Suppose rjt has a median zero with E\rjt\ = 1. If Assump- 
tions 2.1-2.5 hold, then 

Gsn^-Go o,.s., as n — > oo. 

To study the rate of convergence of 9 sn , we reparameterize the weighted 
log-likelihood function (2.3) as follows: 

L n (u) = nL sn (9 + u)- nL sn (9 ), 

where u G A = {u = (n^, u' 2 )' : u + 6q £ 0}. Let u n = 9 sn — 9q. Then, u n is the 
minimizer of L n (u) on A. Furthermore, we have 

n n n 

(2.5) L n (u) = w t A t (u) + w t B t {u) + ^ w t C t (u), 



t=i t=i t=i 



where 



At ^ = 7r75^ 0gt(7o+m)l - k*(7o)|], 

\/ht{e Q ) 



B t{U ) = iogy^T^-io g y^) + ' £ ; ( ; o)l - J^L, 



C t (u) 



[|e t (7o + «i)| - k*(7o)|]- 



.y/ht(e + u) y/hd9o) 
Let /(•) be the indicator function. Using the identity 
\x-y\- \x\ = -y[I(x > 0) - I(x < 0)] 



(2.6) 



+ 2 / [I(x < s) - I(x < 0)] ds 
Jo 

for x ^0, we can show that 

r-qt{u) 

(2.7) = g t (u)[/(^ > 0) - /(r? t < 0)] + 2 / X t {s)ds, 



where X f (s) = I(n t < s) - I(n t < 0), q t (u) = qu{u) + q 2 t(u) with 
^{u) = , = -- — ^— and q 2t (u) ~ 



s/Uh) 89 ^ ' 2^%) 8969' 

and £* lies between 70 and 70 + u\. Moreover, let JFt = o-{r]k '■ k < t} and 

it{u) = 2w t / X t {s)ds. 
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Then, from (2.7), we have 
n 

(2.8) Yl w tMu) = u'Tin + ni„(«) + n 2n (n) + n 3 „(u) 



t=i 



where 



n ln (u) = £{&(u) - sfoMi^u]}, 

i=l 

n 2 „(«) = ^^[6(«)|^ t -i], 



n 3n (u) = ^w t (ftt(u)[J(^ > 0) - /fa < 0)] 

-Qt(u) 



t=l 



t=l 



9lt(«) 



X t (s)ds. 



By Taylor's expansion, we can see that 



(2.9) 
where 



y~)w t Bt(u) = u'T 2n + n 4n (-u) + n 5n (n), 



^2n 



Ew t dh t (9 ) 
2h7^SS-0--\rH\ 



t=i 



n 4 n(«) = u' ^ W t 



i=l 



06 



£t(7o) 



1\ 1 d/i t (C*) 3fct(C* 



n 5 n(n) = u'J2w t ( J- J 

t=i 



e*(7o) 



4 J h 2 t {C) 36 



1 5 2 /i t (C*) 



89' 



-u, 



and C* lies between #o and Oq + u. 

We further need one assumption and three lemmas. The first lemma is 
directly from the central limit theorem for a martingale difference sequence. 
The second- and third- lemmas give the expansions of IIj n (u) for % = 1, . . . , 5 
and ^]t = iCt{u). The key technical argument is for the second lemma for 
which we use the bracketing method in Pollard (1985). 
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Assumption 2.6. r\t has zero median with E\rjt\ = 1 and a continuous 
density function g(x) satisfying g(0) > and sup xeR g(x) < oo. 

Lemma 2.1. LetT n = Ti n + T2 n . If Assumptions 2.1-2.6 hold, then 
-.T n — >d N(0, S7q) asn— >oo, 



'n 

where — >d denotes the convergence in distribution and 
o - F ( w " de t ( l0 )de t ( l0 ) \ E V ?-1 ( w\ 8h t (6 ) 8h t (6 ) 
\h t {6 ) 86 86' J 4 U?(#o) 96 86' 

Lemma 2.2. If Assumptions 2.1-2.6 hold, then for any sequence of ran- 
dom variables u n such that u n = o p (l), it follows that 

ni„(n n ) = o p (y/n\\u n \\ + n||u n || 2 ), 

where o p (-) — > in probability as n — > oo. 

Lemma 2.3. If Assumptions 2.1-2.6 hold, then for any sequence of ran- 
dom variables u n such that u n = o p (l), it follows that: 



(i) 


n 2n ("n) 


(h) 


n 3n (n n ) 


iii) 


n 4 n(«n) 


» 


n 5n (n n ) 






(v) 





t=l 



where 



and 



£i = <?(0)£ 



h t {6 ) 86 86' 



I f w t 8h t {6 ) 8h t \v 



8 \h 2 t {6 ) 86 86' 

The proofs of Lemmas 2.2 and 2.3 are given in Section 6. We now can 
state one main result as follows: 

Theorem 2.2. If Assumptions 2.1-2.6 hold, then: 

(i) y/H0 m -e o ) = O p (l), 

(ii) y/n(& sn - 6q) -> d N(0, 1 n T,Q 1 ) asn->oo, 
where £q = Si + £2- 
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Proof, (i) First, we have u n = o p (l) by Theorem 2.1. Furthermore, 
by (2.5), (2.8) and (2.9) and Lemmas 2.2 and 2.3, we have 

(2.10) L n (u n ) = u n T n + (^u n )'E (^u n ) + o p (^/n\\u n \\ + n||u n || 2 ). 

Let A m j n > be the minimum eigenvalue of So- Then 



Ln{u n ) > -\\\fnu n \ 



1 



n 



+ Op(l) 



+ n||u n || 2 [A min + Op(l)]. 



Note that L n (u n ) < 0. By the previous inequality, it follows that 

1 , 



(2.11) Vn\\un\\ < [A min + o p (l)]- 1 



n 



+ Op(l) 



P (l), 



where the last step holds by Lemma 2.1. Thus, (i) holds, 
(ii) Let u* = — Sq 1 T n /2n. Then, by Lemma 2.1, we have 



oo. 



Hence, it is sufficient to show that ^fnu n — ^fnu* n = o p (l). By (2.10) and (2.11) 
we have 

L n (un) = (y/nUn)'—7=T n + (^u n )'T l0 (^u n ) + o p (l) 



n 



= (v / nn ? - t ) / S (v / ^^n) - 2(V™ n )'£o(\/nii* ) + Op(l). 
Note that (2.10) still holds when u n is replaced by it*. Thus, 



1 



L n(K) = (y/nu* n )'—T n + (v^O'SoCV^O + o p (l) 



= -(V»<)'So(v^<) + o p (l). 
By the previous two equations, it follows that 



Ln(U r . 



(2.12) 



£n(«n) = - S (\/n?i n - \/nu* n ) + o p (l) 



> A r 



\\fnu n — \/nu* n \\ +o p (l). 

Since L n (u n ) - L n «) = n[L sn (6» + u n ) - L sn (9 + <)] < a.s., by (2.12), 
we have ||v / n , u n — \fnu* n || = o p (l). This completes the proof. □ 

Remark 2.1. When Wt = 1, the limiting distribution in Theorem 2.2 is 
the same as that in Li and Li (2008). When r = s = (ARMA model), it 
reduces to the case in Pan, Wang and Yao (2007) and Zhu and Ling (2011b). 
In general, it is not easy to compare the asymptotic efficiency of the self- 
weighted QMELE and the self- weight QMLE in Ling (2007). However, for 
the pure ARCH model, a formal comparison of these two estimators is given 
in Section 3. For the general ARMA-GARCH model, a comparison based 
on simulation is given in Section 4. 
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In practice, the initial values Yq are unknown, and have to be replaced by 
some constants. Let it{9), h t (0) and wt be £t(9), ht(9) and wt, respectively, 
when Yq are constants not depending on parameters. Usually, Yq are taken 
to be zeros. The objective function (2.3) is modified as 



L sn {9) = -Y j w t l t {9) and ~l t {0) = log <JUp) + 



fc(7)l 

t=i sj~ht(0) 
To make the initial values Yq ignorable, we need the following assumption. 

Assumption 2.7. E\w t — wt^ ^ = 0(t~ 2 ), where to = min{t, 1}. 

Let 9 sn be the minimizer of L sn (9), that is, 

9 sn = argmin L sn (6>). 
e 

Theorem 2.3 below shows that 9 sn and 9 sn have the same limiting property. 
Its proof is straightforward and can be found in Zhu (2011). 

Theorem 2.3. Suppose that Assumption 2.7 holds. Then, as n— >oo, 

(i) if the assumptions of Theorem 2.1 hold 

9su^9q a.s., 

(ii) if the assumptions of Theorem 2.2 hold 

y/n(9 m - 6q) ^ d N(0, l^ 1 ^^ 1 ). 

3. Local QMELE. The self-weighted QMELE in Section 2 reduces the 
moment condition of et, but it may not be efficient. In this section, we 
propose a local QMELE based on the self-weighted QMELE and derive its 
asymptotic property. For some special formal comparison of the 

local QMELE and the self-weighted QMELE is given. 

Using 9 sn in Theorem 2.2 as an initial estimator of 0q, we obtain the local 
QMELE 9 n through the following one-step iteration: 

(3.1) 9 n = 9 sn - [2S* (9 an )] l T*(9 sn ), 

where 

vvm-V\f 9 ^ d g *(7) det{j) 1 dh t {9) dht(9) 



t=i 



h t (9) 89 89' 8h 2 t (9) 89 89' 



1 dh t (9)^ 
+ 2h t (9) 89 V -I^>l> 
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In order to get the asymptotic normality of 8 n , we need one more assumption 
as follows: 



Assumption 3.1. Erft £i=i a 0i + ELi Ak < 1 or 

r s 

E vt ^2 a oi+^2 fa = 1 



i=l i=l 

with r\t having a positive density on R such that E\r]t\ T < oo for all r < tq 
and E\i] t \ T ° = oo for some tq G (0,oo]. 

Under Assumption 3.1, there exists a unique strictly stationary causal 
solution to GARCH model (1.2); see Bougerol and Picard (1992) and Basrak, 
Davis and Mikosch (2002). The condition Erft Yll=i a 0i + J2i=i@0i < 1 is 
necessary and sufficient for Eel < 00 under which model (1.2) has a finite 
variance. When Erft £[ =1 a 0i + Y%=1 Am = h model ( L2 ) is called IGARCH 
model. The IGARCH model has an infinite variance, but -E|et| 2t < oo for all 
i £ (0,1) under Assumption 3.1; see Ling (2007). Assumption 3.1 is crucial 
for the ARMA-IGARCH model. From Figure 2 in Section 2, we can see that 
the parameter region specified in Assumption 3.1 is much bigger than that 
for -E|et| 3 < oo which is required for the asymptotic normality of the global 
QMELE. Now, we give one lemma as follows and its proof is straightforward 
and can be found in Zhu (2011). 

Lemma 3.1. If Assumptions 2.1-2.3, 2.6 and 3.1 hold, then for any 
sequence of random variables 9 n such that \Jn(6 n — 9q) = O p {\), it follows 
that: 

(i) I[T*(^)-T*(0 o )] = {2X + o p (l)}(9 n -9 ) + oJ^= 
n \\ n 

(") \K{0n) = S + o p (l), 

(iii) -=T*(9 ) -t d N(0,Q) as n -too, 

where 

1 de t ( l0 ) 8e t ( l0 ) \ Erj-l J 1 dh t (9 ) 8h t (9 ) 



h t {9 ) 89 89' J 4 \h%(6 ) 99 89 
v (n\w( 1 ^(7o) de t ( l0 ) \ I f 1 8h t (9 ) 8h t (9 



h t (9 ) 89 89' J 8 \hf(0 o ) 89 89' 

Theorem 3.1. If the conditions in lemma 3.1 are satisfied, then 
Vn(9 n -9 )^dN(0,\Y l - 1 nY l - 1 ) asn^oo. 
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Proof. Note that y/n{9 sn - 9 ) = O p (l). By (3.1) and Lemma 3.1, we 
have that 







-1 




— 9 — 


-S*(^ sn ) 

n 




-Tn0sn) 
n 



= 9 sn - [2S + ^(i)]" 1 !"^^) + [22 + o p (i)\(e sn - 9 ) + o p 

It follows that 

M0n-eo) = ^~l T ^ 6o) +Op(l). 

By Lemma 3.1(iii), we can see that the conclusion holds. This completes the 
proof. □ 

Remark 3.1. In practice, by using 9 sn in Theorem 2.3 as an initial 
estimator of 9q, the local QMELE has to be modified as follows: 

9 n = 6sn — [2S* (9 sn )]~ l T*{6 sn ) , 

where and T*{9) are defined in the same way as £*(#) and T*(9), 

respectively, with Et{9) and ht{9) being replaced by £t{9) and ht{9). However, 
this does not affect the asymptotic property of 9 n ; see Theorem 4.3.2 in Zhu 
(2011). 

We now compare the asymptotic efficiency of the local QMELE and the 
self-weighted QMELE. First, we consider the pure ARMA model, that is, 
model (1.1)— (1.2) with ht being a constant. In this case, 

fl = E(w 2 t X lt X[ t ), S = g(0)E(w t X lt X' lt ), 

Q = E{X lt X' u ) and £ = g(0)Q, 

— 1/2 

where X±t = h t dety r fo)/d9. Let b and c be two any m-dimensional con- 
stant vectors. Then, 

c'Eobb'Eoc = {E[{c>^gl^)w t X u ){^g~(0)X' lt b)]} 2 

< Eic'v^wtXufEiyfgJojx'ub) 2 

= [c'g(0)n c][b"Eb] = c'[g(0)n b'Zb]c. 

Thus, <7(0)i7o& / S6 / — So66'So > (a positive semi-definite matrix) and hence 
ls o& = tr(fi~ 1/2 S 66 / Sofi " 1/2 ) < tr( 5 (0)fe'S6) = g(0)b'Eb. It follows 
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that E ^o^o 1 > [0(O)£] _1 = E^OE -1 . Thus, the local QMELE is more 
efficient than the self- weighted QMELE. Similarly, we can show that the 
local QMELE is more efficient than the self-weighted QMELE for the pure 
GARCH model. 

For the general model (1.1)— (1.2), it is not easy to compare the asymptotic 
efficiency of the self- weighted QMELE and the local QMELE. However, 
when rjt ~ Laplace(0, 1), we have 

s ° = E ( K Y XltX ' lt + Y X2tX zt) ' 

Q = E(w 2 t X lt X' lt + ^X 2t X' 2 )j , 

V = E{\X lt X' lt + \X 2t X' 2t ) and ft = 2E, 
where X 2t = h^ 1 dh t (9 )/d9. Then, it is easy to see that 
c'Eo&fe'Eoc 

= {E[^2-^ Wt X lt ){2-^X[ t b) + (c'2- 5 / 4 ™^)(2- 7/4 X^)]} 2 
<{^E(c'2-y^w t X lt fE{2-y^X' lt bf 

+ ^ E{c'2^l^w t X 2t YE(2~V^X' 2t bY} 2 

< [E{c'2- x ' A w t X lt ) 2 + E(c'2- 5 / 4 w t X 2t ) 2 } 

x [E{2-^X' lt bf + E(2-^X' 2t b) 2 } 
= [d2~ 1 / 2 n c}[b'2- x ' 2 T l b] = c'fi^Uob'Eblc. 

Thus, 2~ 1 n b'J:b' - E 6&'E > and hence &'E fto ^ob = tr(ft~ 1/2 E M/ x 

Eo^o V2 ) ^ tr(2~ 1 6'E6) = 2~ 1 6 / E6. It follows that Eq 1 n So 1 ^ 2Tl ^ = 
S -1 nS -1 . Thus, the local QMELE is more efficient than the global self- 
weighted QMELE. 

In the end, we compare the asymptotic efficiency of the self-weighted 
QMELE and the self-weighted QMLE in Ling (2007) for the pure ARCH 
model, when Erjf < oo. We reparametrize model (1.2) when s = as follows: 

r 

(3.2) Vt = vtVK and h* = a* + ^ajj^, 

i=l 

where r/ t * = rj t /y/Erfi, h* t = (Er] 2 )h t and 9* = (ag, a*, . . . ,«*)' = (Er] 2 )9. 
Let 9* n be the self-weighted QMLE of the true parameter, 9q, in model (3.2). 
Then, 9 sn = 9*JEt] 2 is the self- weighted QMLE of 9q, and its asymptotic 
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covariance is 

ri = Kl [E(w t X 2t X!, t )}- 1 E(w 2 t X 2t X! it )[E(w t X 2t X' 2t ))- 1 , 

where K\ = Er]f / (Erf) ~ !• By Theorem 2.2, the asymptotic variance of 
the self-weighted QMELE is 

T 2 = K 2 [E(w t X 2t X' 2i )]- 1 E(w 2 t X 2t X' 2t )[E{w t X 2t X' 2t )}-\ 

where k 2 = ^{Erj 2 — 1). When rjt ~ Laplace(0, 1), K\ = 5 and k 2 = 4. Thus, 
T\ >T 2 , meaning that the self- weighted QMELE is more efficient than the 
self-weighted QMLE. When rjt = fjt/E\fjt\, with rjt having the following mix- 
ing normal density: 



we have E\rjt\ = 1, 



and 



/(*) HI + 



Erjj 



1 2(l-e + er) 2 

3tt(1 -e + er 4 ) 
2(l-e + er) 2 (l-e + er 2 )' 



where </>(x) is the pdf of standard normal, < e < 1 and r > 0. The asymp- 
totic efficiencies of the self-weighted QMELE and the self-weighted QMLE 
depend on e and r. For example, when e = 1 and r = wtt/2, we have 
Ki = (6 — 7r) /V and «2 = 2ir — 4, and hence the self-weighted QMLE is more 
efficient than the self- weighted QMELE since T\ <T 2 . When e = 0.99 and 
r = 0.1, we have K\ = 28.1 and k 2 = 6.5, and hence the self-weighted QMELE 
is more efficient than the self- weighted QMLE since Ti >T 2 . 



4. Simulation. In this section, we compare the performance of the global 
self-weighted QMELE (§ sn ), the global self-weighted QMLE (6 sn ), the local 
QMELE (§ n ) and the local QMLE (0 n ). The following AR(1)-GARCH(1, 1) 
model is used to generate data samples: 

yt = V + (i>iyt-i + £t, 

(4-1) 

£t = Vtvh and h t = a + a\e 2 t _ x + fi\h t -\. 

We set the sample size n = 1,000 and use 1,000 replications, and study the 
cases when rjt has Laplace(0, 1), iV(0, 1) and distribution. For the case with 
Eej < oo (i.e., Eif t a m + A)i < 1), we take 9 = (0.0,0.5,0.1,0.18,0.4). For 
the IGARCH case (i.e., Erfta i + /3 i = 1), we take = (0.0, 0.5, 0.1, 0.3, 0.4) 
when rjt ~ Laplace(0, 1), = (0.0, 0.5, 0.1, 0.6, 0.4) when 7/t ~ iV(0, 1) and 
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#o = (0.0, 0.5, 0.1, 0.2, 0.4) when rjt ~ £3. We standardize the distribution of r\t 
to ensure that E\r]t\ = 1 for the QMELE. Tables 1-3 list the sample biases, 
the sample standard deviations (SD) and the asymptotic standard devia- 
tions (AD) of 

@sm@sm dn and n . We choose Wt as in (2.4) with C being 
90% quantile of {y±, . . . , y n } and yi = for i < 0. The ADs in Theorems 2.2 

and 3.1 are estimated by Xsn = V^^T^Xm^^ anc ^ Xn = l/^S" 1 ^!!" 1 , 
respectively, where 

(% n ) de t (j sn ) . w t 8h t (9 sn ) dh t (9 sn ) 





t=i 


f ff(0)itft 






r ™? 




E„ 




r 5(0) , 




t=i 


r 1 - 

Ut(<?n) 



a0 ay 8/if(^ n ) ^ w 

g£t(7sn) gft(7gn) £77? - 1 u| dh t {9 sn ) dh t (9 sn ) 
89 89' 4 /,2(4 n ) 89 89' 

9e t (7„) 5e t (7„) , 1 8h t (9 n ) 8h t (9 n ) 



89 89' 8h 2 t (9 n ) 89 89' 

8e t {%)8e t {%) Erjj-1 1 8h t (9 n ) 8h t (9 n ) 
89 89' + 4 h 2 t (9 n ) 89 89' 

From Table 1, when rjt ~ Laplace(0, 1), we can see that the self- weighted 
QMELE has smaller AD and SD than those of both the self-weighted QMLE 
and the local QMLE. When rjt ~ N(0, 1), in Table 2, we can see that the self- 
weighted QMLE has smaller AD and SD than those of both the self- weighted 
QMELE and the local QMELE. From Table 3, we note that the SD and AD 
of both the self-weighted QMLE and the local QMLE are not close to each 
other since their asymptotic variances are infinite, while the SD and AD 
of the self-weighted QMELE and the local QMELE are very close to each 
other. Except 9 n in Table 3, we can see that all four estimators in Tables 1-3 
have very small biases, and the local QMELE and local QMLE always have 
the smaller SD and AD than those of the self-weighted QMELE and self- 
weighted QMLE, respectively. This conclusion holds no matter with GARCH 
errors (finite variance) or IGARCH errors. This coincides with what we 
expected. Thus, if the tail index of the data is greater than 2 but Erjf = 00, 
we suggest to use the local QMELE in practice; see also Ling (2007) for 
a discussion. 

Overall, the simulation results show that the self-weighted QMELE and 
the local QMELE have a good performance in the finite sample, especially 
for the heavy-tailed innovations. 

5. A real example. In this section, we study the weekly world crude oil 
price (dollars per barrel) from January 3, 1997 to August 6, 2010, which 
has in total 710 observations; see Figure 3(a). Its 100 times log-return, de- 
noted by {yt}J£i, is plotted in Figure 3(b). The classic method based on the 
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Table 1 

Estimators for model (4- 1 ) when rjt ~ Laplace(0, 1) 



O = (0.0,0.5,0.1,0.18,0.4) 6 = (0.0,0.5,0.1,0.3,0.4) 



Self-weighted QMELE (0 sn ) Self-weighted QMELE (fl sn ) 





f-t-sn 


4>lsn 


&Osn 




/3lsn 


(J S 71 


4>lsn 






/3lsn 


Bias 


0.0004 


-0.0023 


0.0034 


0.0078 


-0.0154 


0.0003 


-0.0049 


0.0031 


0.0054 


-0.0068 


SD 


0.0172 


0.0317 


0.0274 


0.0548 


0.1125 


0.0195 


0.0318 


0.0219 


0.0640 


0.0673 


AD 


0.0166 


0.0304 


0.0255 


0.0540 


0.1061 


0.0192 


0.0311 


0.0218 


0.0624 


0.0664 






Local QMELE (0 n ) 






Local QMELE (0„) 






An 


4>ln 


C*On 




/3m 


An 


<t>ln 






0m 


Bias 


0.0008 


-0.0019 


0.0027 


0.0002 


-0.0094 


0.0010 


-0.0044 


0.0024 


-0.0008 


-0.0025 


SD 


0.0170 


0.0253 


0.0249 


0.0400 


0.0989 


0.0192 


0.0261 


0.0203 


0.0502 


0.0591 


AD 


0.0162 


0.0245 


0.0234 


0.0407 


0.0920 


0.0190 


0.0258 


0.0206 


0.0499 


0.0591 






Self-weighted QMLE (0 3n ) 






Self-weighted QMLE (0 sn ) 






(1 s ri. 


4>lsn 




<*lsn 


/3lsn 




4>lsn 


O-Osn 






Bias 


-0.0003 


-0.0016 


0.0041 


0.0114 


-0.0227 


0.0005 


-0.0039 


0.0031 


0.0104 


-0.0127 


SD 


0.0243 


0.0451 


0.0301 


0.0624 


0.1237 


0.0283 


0.0458 


0.0242 


0.0750 


0.0755 


AD 


0.0240 


0.0443 


0.0285 


0.0607 


0.1184 


0.0283 


0.0461 


0.0243 


0.0704 


0.0741 






Local QMLE (0„) 






Local QMLE (0 n ) 






An 


4>ln 


OLOn 




Pm 


A" 


4>ln 


C*Oti 




/3i„ 


Bias 


0.0007 


-0.0034 


0.0026 


0.0037 


-0.0144 


0.0022 


-0.0045 


0.0020 


0.0044 


-0.0081 


SD 


0.0243 


0.0368 


0.0279 


0.0461 


0.1115 


0.0282 


0.0377 


0.0227 


0.0579 


0.0674 


AD 


0.0236 


0.0361 


0.0261 


0.0459 


0.1026 


0.0281 


0.0384 


0.0230 


0.0564 


0.0659 



Akaike's information criterion (AIC) leads to the following model: 
y t = 0.2876e t _i + 0.1524 £i _ 3 + e u 

(5.1) 

(0.0357) (0.0357) 

where the standard errors are in parentheses, and the estimated value of 
is 16.83. Model (5.1) is stationary, and none of the first ten autocorrelations 
or partial autocorrelations of the residuals {it} are significant at the 5% 
level. However, looking at the autocorrelations of {ef }, it turns out that the 
1st, 2nd and 8th all exceed two asymptotic standard errors; see Figure 4(a). 
Similar results hold for the partial autocorrelations of {ef} in Figure 4(b). 
This shows that {e^} may be highly correlated, and hence there may exist 
ARCH effects. 

Thus, we try to use a MA(3)-GARCH(1, 1) model to fit the data set {yt}- 
To begin with our estimation, we first estimate the tail index of {yf } by 
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Table 2 

Estimators for model (4-1) when rjt ~ N(0, 1) 

9 = (0.0,0.5,0.1,0.18,0.4) O = (0.0,0.5,0.1,0.6,0.4) 

Self- weighted QMELE (0 S „) Self- weighted QMELE (0 S „) 

sn O^Osn OtXsn ftXsn fi>sn 01 sn ^Osn ^lsn Plsn 

Bias 0.0003 -0.0042 0.0075 0.0065 -0.0372 -0.0008 -0.0034 0.0029 -0.0019 -0.0028 

SD 0.0192 0.0457 0.0366 0.0600 0.1738 0.0255 0.0437 0.0204 0.0815 0.0512 

AD 0.0189 0.0443 0.0379 0.0604 0.1812 0.0257 0.0424 0.0202 0.0809 0.0491 

Local QMELE (0„) Local QMELE (0„) 

A" 4>ln &On &\ n /3l„ A" <f>\n «On &ln Pin 

Bias 0.0006 -0.0051 0.0061 0.0019 -0.0268 0.0000 -0.0040 0.0029 -0.0048 -0.0015 

SD 0.0184 0.0372 0.0357 0.0487 0.1674 0.0252 0.0364 0.0197 0.0671 0.0472 

AD 0.0183 0.0370 0.0350 0.0488 0.1652 0.0252 0.0359 0.0194 0.0685 0.0453 

Self-weighted QMLE (8 an ) Self-weighted QMLE (0 sn ) 

fl & ri. 01sn. C^Osra OtXsn /3lsn. fJ-sn 01 sn. ^Osn C^lsn. /3lsrc. 

Bias -0.0001 -0.0039 0.0069 0.0089 -0.0361 -0.0006 -0.0016 0.0024 0.0027 -0.0045 

SD 0.0151 0.0366 0.0333 0.0566 0.1599 0.0196 0.0337 0.0189 0.0770 0.0481 

AD 0.0150 0.0352 0.0345 0.0568 0.1658 0.0200 0.0329 0.0188 0.0757 0.0459 

Local QMLE (0 n ) Local QMLE (0 n ) 

ra C^Ora C^lra fJln 



Bias 0.0009 -0.0048 0.0055 0.0038 -0.0252 0.0004 -0.0031 0.0024 -0.0019 -0.0027 
SD 0.0145 0.0300 0.0322 0.0454 0.1535 0.0195 0.0287 0.0183 0.0633 0.0442 
AD 0.0145 0.0294 0.0320 0.0460 0.1517 0.0197 0.0279 0.0181 0.0644 0.0424 



using Hill's estimator {& y (k)} with k = 1,...,180, based on {y|}J£i- The 
plot of {cij/(^)}l=i is given in Figure 5, from which we can see that the 
tail index of \y\ } is between 1 and 2, that is, Eyf = oo. So, the standard 
QMLE procedure is not suitable. Therefore, we first use the self-weighted 
QMELE to estimate the MA(3)-GARCH(1, 1) model, and then use the one- 
step iteration as in Section 3 to obtain its local QMELE. The fitted model 
is as follows: 

yt = 0.3276 £i _i + 0.1217e t _ 3 + e u 

(0.0454) (0.0449) 

^ 5 ' 2 ^ h t = 0.5147 + 0.04354-1 + 0.8756/t t _i , 

(0.3248) (0.0159) (0.0530) 

where the standard errors are in parentheses. Again model (5.2) is station- 
ary, and none of first ten autocorrelations or partial autocorrelations of the 
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Table 3 

Estimators for model (4-1) when rjt ~ t$ 



O = (0.0,0.5,0.1,0.18,0.4) O = (0.0,0.5,0.1,0.2,0.4) 



Self- weighted QMELE (0 S „) Self- weighted QMELE (0 S „) 





fl>sn 


4>lsn ao«n 


aian 






4>lsn «0»n 




/3lsn 


Bias 


0.0004 


-0.0037 0.0059 


0.0081 


-0.0202 


-0.0005 


-0.0026 0.0032 


0.0088 


-0.0158 


SD 


0.0231 


0.0416 0.0289 


0.0600 


0.1084 


0.0221 


0.0404 0.0252 


0.0619 


0.0968 


AD 


0233 


0.0393 0.0282 


0.0620 


0.1101 


0238 


0.0393 0.0266 


0.0637 


0.1001 






Local QMELE (0„) 






Local QMELE (0„) 






An 


4>ln OLQn 


CLln 




An 


4>ln &On 


OL\-n 




Bias 


0.0011 


-0.0039 0.0041 


0.0011 


-0.0115 


0.0001 


-0.0028 0.0019 


0.0029 


-0.0092 


SD 


0.0229 


0.0328 0.0256 


0.0429 


0.0955 


0.0218 


0.0325 0.0226 


0.0450 


0.0842 


AD 


0.0228 


0.0314 0.0252 


0.0461 


0.0918 


0.0233 


0.0317 0.0243 


0.0483 


0.0851 






Self- weighted QMLE (0 STl ) 






Self-weighted QMLE (0 S „) 






[1 s ri. 


4>lsn «Osn 


aisn 


/3i s « 


fJ-srt 


<plsn CtOsn 






Bias 


-0.0056 


-0.0151 0.0029 


0.0503 


-0.0594 


0.0036 


-0.0141 0.0115 


0.0442 


-0.0543 


SD 


0.9657 


0.1045 0.0868 


0.2521 


0.1740 


0.1827 


0.1065 0.3871 


0.2164 


0.1605 


AD 


0.0536 


0.0907 33.031 


0.1795 


34.498 


0.0517 


0.0876138.38 


0.1875 


11.302 






Local QMLE (0„) 






Local QMLE (0„) 






pn 


4>ln OLOn 


Ctln 




An 


4>ln OLO n 


Ctln 


/3i„ 


Bias 


-0.0048 


-0.0216-2.1342 


0.0185 


3.7712 


-0.0010 


-0.0203 1.3241 


0.0253 


-0.1333 


SD 


0.0517 


0.1080 38.535 


0.3596 


83.704 


0.0521 


0.1318 42.250 


0.2524 


3.4539 


AD 


0.0508 


0.0661 55.717 


0.1447 


45.055 


0.0520 


0.0707 13.761 


0.1535 


1.1343 



residuals fjt — ith t are significant at the 5% level. Moreover, the first 
ten autocorrelations and partial autocorrelations of {fj^} are also within 
two asymptotic standard errors; see Figure 6(a) and (b). All these results 
suggest that model (5.2) is adequate for the data set {yt}- 

Finally, we estimate the tail index of r/| in model (5.2) by using Hill's 
estimator a v (k) with k = 1, . .. , 180, base on {f/f }. The plot of {^(k)}^^ 
is given in Figure 7, from which we can see that Erfi is most likely finite, 
but Er]f is infinite. Furthermore, the estimator of Erft is X^ILi l n = 1-6994, 
and it turns out that &in(Ylt=i Vt/ n ) + Pin = 0.9495. This means that 
Eel < 00 ■ Therefore, all the assumptions of Theorem 3.1 are most likely 
satisfied. In particular, the estimated tail indices of {yf } and {f/^ } show the 
evidence that the self-weighted/local QMELE is necessary in modeling the 
crude oil price. 
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(a) (b) 
Fig. 4. (a) The autocorrelations for {it} and (b) the partial autocorrelations for {it}- 



6. Proofs of Lemmas 2.2 and 2.3. In this section, we give the proofs of 
Lemmas 2.2 and 2.3. In the rest of this paper, we denote C as a universal 
constant, and G{x) be the distribution function of rjt- 

Proof of Lemma 2.2. A direct calculation gives 

,2w t det(7o) 



% 89 



-M t u) 



QMELE FOR ARMA-G ARCH /IG ARCH MODELS 



21 




20 40 60 80 100 120 140 160 180 



Fig. 5. Hill estimators {a y (k)} for {j/f}. 



Lag 

(a) 



Lag 

(b) 



Fig. 6. (a) The autocorrelations for {r?t } and (b) the partial autocorrelations for {i)t}- 



where M t (u) = L X t {— qu(u)s) ds. Thus, we have 



|n ln (u)|<2|M|^ 

It is sufficient to show that 



J 3 t =l 



(6.1) 



w t de t (6 ) 



% 86 



^{M t {u n )-E{M t {u n )\T t -i}} 



J t=i 



Op(Vn + n\\u T 
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Hill estimators 



80 100 120 140 160 180 

k 



Fig. 7. The Hill estimators {a v (k)} for {fjl} of model (5.2). 
— 1/2 

for each 1 < j < m. Let mt = wth t dst(Bo)/d9j, ft(u) = mtMt{u) and 

1 n 

D » = -1= £{/t(«) " E lft(u)\Tt-i}}. 



t=i 



Then, in order to prove (6.1), we only need to show that for any rj > 0, 

/ \ \DJu)\ 
(6.2) 



\\u\\< v 1 + v^lMI 



Note that mt = max{mt,0} — max{— mt,0}. To make it simple, we only 
prove the case when mt > 0. 

We adopt the method in Lemma 4 of Pollard (1985). Let 5 = {ft(u) ■ 
\\u\\ <rf\ be a collection of functions indexed by u. We first verify that 5 
satisfies the bracketing condition in Pollard (1985), page 304. Denote B r {Q) 
be an open neighborhood of £ with radius r > 0. For any fix e > and 
<5 <r/, there is a sequence of small cubes {B eS / Cl (ui)}^f 1 to cover Bg(0), 
where K e is an integer less than coe~ m and cq is a constant not depending 
on e and 5; see Huber (1967), page 227. Here, C\ is a constant to be selected 
later. Moreover, we can choose Ui{5) C B £ $/ Cl (ui) such that {Ui(5)}^f 1 be 
a partition of -6,5(0). For each u £ Ui(S), we define the bracketing functions 
as follows: 

<9et(7o) 



C X t (-qit(u)s±^ 7 = 
Jo V C lv ^ 



#0 



Since the indicator function is nondecreasing and mt > 0, we can see that, 
for any u 6 Ui(5), 

fr(u t )<ft(u)<ft + ( Ui ). 
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2e5 



3.3) E[f+{ Ui ) - / t -(«i)|^t-i] < -yr sup 9 (x)-^ 



^ (70) 



00 



e5A t 
Ci 



Setting C\ = E(A t ), we have 

E[f+( Ui ) - f t -( Ui )) = E{E[f+( Ul ) - /r(«i)|^ t -i]} < e*. 

Thus, the family ^ satisfies the bracketing condition. 

Put 6 k = 2"V Define B(k) = B Sk (Q), and A(k) to be the annulus B{k)/ 
B(k + 1). Fix e > 0, for each 1 < i < K e , by the bracketing condition, there 
exists a partition {^(f^)}^ of B(k). 

We first consider the upper tail. For u £Ui(5 k ), by (6.3) with 6 = 6 k , we 
have 



DJu) < 



1 n 

v n t =l 

1 - 



1 n 



t=i 



where 



1 n 

D+( Ui ) = -= £{/t + («i) - S[/+(^)|^-i]}. 

V t=l 



Denote the event 



E n 



On with u £ Ui(8k), it follows that 

(6.4) D n («) < + 2v^e<5 fe . 

On .A(fc), the divisor 1 + ^/n||u|| > y/n6k+i = y/n6 k /2. Thus, by (6.4) and 
Chebyshev's inequality, it follows that 

p( sup >6e,E n 

\ueA(k) 1 + v n \m\ 



<P[ sup D n (u) > 3^e6 k ,E n 
v ueA(fc) 
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(6.5) 



< P ( max sup D n (u) > 3\/neSk , E n 
\l<i<Ke ue Ui{8 k )nA(k) 



<P( max D+(ui)> ^fne5 k ,E n 

\l<i<K E 

< K £ max P(D+(ui) > VneS k ) 

l<i<K e 



< K e max 

Ki<K e 



Note that \q\t{ui)\ < CS^pt-i and m| < CiUf^p t _ 1 for some p G (0,1) by 
Lemma A.l(i), and s\xp x&R g(x) < oo by Assumption 2.6. By Taylor's ex- 
pansion, we have 

E[(ft(u l )f) = E{E[(ft{u l )f\T t ^)} 



<E 
< CE 



/??; 



A t -9it(ui)s + 



e<5 fc 



06 



t-i 



ds 



sup \G(x)-G(0)\wte pt -i 

\x\<S k C^ p t-i 



< S k CE(w 2 t f pt -i). 

Since f^~{ui) — E[f^{ui)\ J-t-\\ is a martingale difference sequence, by the 
previous inequality, it follows that 

1 n 

E[(D+(!h)) 2 ] = - E[f+{ Ul )\F t ^}} 2 

t=i 

1 n 

<-E^[(/* + W) 2 i 



(6.6) 



t=i 



n t=i 

= K n (8 k ). 
Thus, by (6.5) and (6.6), we have 

D n {u) 



P[ sup 

KU€A(k) l + V n \\ U \ 



>6e,E n <K, 



7Tn(4) 



By a similar argument, we can get the same bound for the lower tail. Thus, 
we can show that 



(6.7) 



P 



\D n (u) 



sup 

VnGA(fc) 1 + \HM 



>6e,E n <2K t 



6 ne2 ^l 
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Since ir n (5k) — > as k — > oo, we can choose k £ so that 

2TT n (5 k )K £ /(er ] ) 2 <e 
for k> k £ . Let k n be an integer so that n" 1 / 2 < 2~ fe ™ < 2n^ 1 / 2 . Split {u : 

|| -u || < r]} into two sets B(k n + 1) and B(A; n + l) c = Ufc=o^( fc )- % ( 6 - 7 )> 
since 7r n (<!>&) is bounded, we have 

V«eB(fc„+i) c 1 + V™HI 
(6.8) <E P ( SU P rZ^h >fc,E n )+P(E c n ) 

fc=o ' fc=fc £ 

n J n 

<o(^\+4e + P(E c n ). 

Since 1 + \Zre||iz|| > 1 an d V™dk n +i < 1) using a similar argument as 
for (6.5) together with (6.6), we have 

p( sup - Dn j-l > 3e, £ n ) < P( max D+( Ui ) > e, E n 



< 



£ 2 



We can get the same bound for the lower tail. Thus, we have 

( \D n (u)\ 
P( sup - 1 nK ' 1 „ >3e 
V«GB(fc n +i) 1 + Vn\\u\\ 

(6.9) =pf sup J^L >3£iEn )+P(E c n ) 

\ueB(k n +i) 1 + V n IMI / 

2-RT £ 7r ra ((5 fcn+ i) 

^ ^2 ^(AJ- 

Note that 7r n ((5/ Cn+ i) — )• as n — > oo. Furthermore, P(E n ) — > 1 by the ergodic 
theorem. Hence, 

P(-E£)-»0 as n->oo. 
Finally, (6.2) follows by (6.8) and (6.9). This completes the proof. □ 
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Proof of Lemma 2.3. (i). By a direct calculation, we have 

-Ht(u) 



r—qitm 

U 2n (u) = 2 YV / G(s) - G(0) ds 

Jo 



t=i 



it 



.10) = 2 J2 Wt saklds 



t=l 



-qu{u) 



= (y/nu)'[K ln + K 2n (u)](y/nu), 
where lies between and s, and 

K _ 9(0) w f get (70) de f (7o) 

K 2n {u) = -—^Y^wt s[g(^)-g(0)]ds. 
n\M\ t=1 Jo 

By the ergodic theorem, it is easy to see that 

(6.11) K ln = £i + o p (l). 

Furthermore, since |q f it(u)| < C||it||^pt-l for some p £ (0, 1) by Lemma A.l(i), 
it is straightforward to see that for any 77 > 0, 



2 ™ r\m(u)\ 
sup \K 2n (u)\< sup — jj— p- w t / s\g(s*) - g(0)\ds 

\\u\\<ri \\u\\<r) n \\ u \\ t=1 J -\qit(u)\ 



1 n 

<-V sup \g(s) -fif(0)|w t ^ t _i 

By Assumptions 2.4 and 2.6, E(wtCpt~i) < 00 an d su Pxgi?5( 2; ) < 00 • Then, 
by the dominated convergence theorem, we have 



lim E 

T)^0 



sup \g(s) -g(0)\wt€%-i 

\ s \<Cri( pt -i 



0. 



Thus, by the stationarity of {yt} and Markov's theorem, for Ve, 5 > 0, 3ryo(e) > 
0, such that 



(6.12) P( sup |tf 2n (u)|><$) < e - 

ll«ll<»?o ^ 

for all n > 1. On the other hand, since u n = o p (l), it follows that 
(6-13) ^(IKH>%)<| 
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as n is large enough. By (6.12) and (6.13), for Ve, 5 > 0, we have 

P(\K 2n (u n )\ >S)< P{\K 2n {Un)\ > \\u n \\ < Vo) + P(\\u n \\ > Vo) 

<P( sup \K 2n (u)\ >S) + £ - 

\\u\\< m J I 

< £ 

as n is large enough, that is, K 2n {u^) = o p (l). Furthermore, combining (6.10) 
and (6.11), we can see that (i) holds. 

(ii) Let n 3n (u) = (y/nu)' K 3n (£*)(Vnu) + K 4n (u), where 

-qt(u) 



K 4n (u) = 2^2w t X t (s)ds 



t=i 



By Assumption 2.4 and Lemma A.l(i), there exists a constant p S (0, 1) such 
that 



sup 





d 2 et{C) 




d6d6' 



< CE(w t Cpt-i) < oo. 



Since rjt has median 0, the conditional expectation property gives 
w t d 2 e t (C 



E 



% 0986 



/[I(r k >0)-I(r h <0)] =0 



Then, by Theorem 3.1 in Ling and McAleer (2003), we have 



SUp|K 3 n(r)l=Op(l)- 



On the other hand, 



K 4n (u) 2 ^ 



n\\u\\ 



t=i 

n 



-92t(u)/||u|| 



A t (||u|| 2 s - q lt (u))ds 



n 



t=i 



By Lemma A.l, we have \\\u\\ 2 q2t{u)\ < C£ p t-i and |git(ti)| < C||ti||£ p t_i 
for some p € (0, 1). Then, for any 7] > 0, we have 

{XtiCrfS^ + CriZpt-!) 

-C£pt-i 

-Xti-Cif^t-i-Crj^t-i^ds 



sup \Ju(u)\ < w t 

\\u\\<ri 
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< 2Cw t t pt - l {X t {Ci 1 2 t pt _ l + Crtpt-i) 

-Xti-Cif^-Cr,^)}. 

By Assumptions 2.4 and 2.6 and the double expectation property, it follows 
that 



E 



sup |J M (u)| <2CE[w t Z pt - 1 {G(Cr ] 2 ^ 1 +Crt pt - 1 ) 

\u\\<ri 

-Gi-Cv^^-C^i)}} 
< C(rj 2 + rj) sup g(x)E(w t e pt -x) -> 

X 

as 77—7-0. Thus, as for (6.12) and (6.13), we can show that K4 n (u n ) 
o p (n\\u n \\ 2 ) . This completes the proof of (ii). 



(hi) Let n 4n (ii) = (y/nu)'[n 1 Yh=i J 2t(C)](V^ u ): where 

J2t(C)=Wt 





e*(7d) 


(1 





i\ 1 dht(C)dht(C) 



4:Jh 2 {(*) 89 



89' 



By Assumption 2.4 and Lemma A.l(ii)— (iv), there exists a constant p £ (0, 1) 
and a neighborhood Gq of #0 such that 



E 



sup I MC)\ < CE[w t e pt -i(\vM P t-i + 1)] < 00. 
C*ee J 



Then, by Theorem 3.1 of Ling and McAleer (2003), we have 



sup 

C*ee 



1 n 



0p (i). 



Moreover, since (* — > 9q a.s., by the dominated convergence theorem, we 
have 

lim E[J 2t (C n )]=E[J 2t (9 )} = ^ 2 . 

n— >oo 

Thus, (hi) follows from the previous two equations. This completes the proof 
of (hi). 

(iv) Since E\r]t\ = 1, a similar argument as for part (hi) shows that (iv) 
holds. 

(v) By Taylor's expansion, we have 

1 1 _ -v! dh t (c) 

Vh(9 + u) v^) 2(MC*)) 3/2 99 ' 
where £* lies between 9q and #0 + u - By identity (2.6), it is easy to see that 

M70 + «i)| - kt(7o)| = n'^4P[I( m > 0) - I( Vt < 0)] 



+ 2vl 



00 



09 



Xi 



u'_ 8e t (C) 
ht 89 



s ) ds, 
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where £* lies between 70 and 70 + u\. By the previous two equations, it 
follows that 

n 

y^WtCtju) = {y/nu)'[K 5n {u) + K 6n (u)](y/nu), 



f=i 



where 



^5n(«) = - 2^ — ^7— — o« -jr" [I(r/i < 0) - /(% > 



n 

t=i *'H 



1^ w t dh t (C)de t (C) f l v ( u> de t (C) \ , 
#6»(«) = ^> -075 57 ^Zr - / X t 7F= ao s ds - 

By Lemma A.l(i), (hi), (iv) and a similar argument as for part (ii), it is easy 
to see that K§ n {u n ) = o p (l) and K§ n {u n ) = o p (l). Thus, it follows that (v) 
holds. This completes all of the proofs. □ 

7. Concluding remarks. In this paper, we first propose a self-weighted 
QMELE for the ARMA-GARCH model. The strong consistency and asymp- 
totic normality of the global self-weighted QMELE are established under 
a fractional moment condition of St with E?]^ < 00. Based on this estimator, 
the local QMELE is showed to be asymptotically normal for the ARMA- 
GARCH (finite variance) and -IGARCH models. The empirical study shows 
that the self- weighted/local QMELE has a better performance than the self- 
weighted/local QMLE when r]t has a heavy-tailed distribution, while the 
local QMELE is more efficient than the self-weighted QMELE for the cases 
with a finite variance and -IGARCH errors. We also give a real example to 
illustrate that our new estimation procedure is necessary. According to our 
limit experience, the estimated tail index of most of data sets lies in [2,4) in 
economics and finance. Thus, the local QMELE may be the most suitable 
in practice if there is a further evidence to show that Er\\ = 00 . 

APPENDIX 
The Lemma A.l below is from Ling (2007). 

Lemma A.l. Let £ p t be defined as in Assumption 2.4- If Assumptions 2.1 
and 2.2 hold, then there exists a constant p G (0, 1) and a neighborhood ©o 
of 6q such that: 

Bup|e t _i(7)| <C£ pt _i, 
e 

... dztd) ^ 7 

(1) sup — < C£pt-i and 

e 
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sup 





d 2 et{l) 



(ii) 
(iii) 



su V h t {6)<Ce pt -i, 
e 



sup 

e 



1 dh t (6) 



(iv ) sup 
e 



h t (9) 85 
1 dht(6) 



<C£ pt -i. 



for any t x G (0, 1), 



Lemma A. 2. For any 9* £ 9, let B v (9*) = {9 £ Q : \\Q - Q*\\ < r]} be 
an open neighborhood of 8* with radius r] > 0. If Assumptions 2.1-2.5 hold, 
then: 

(i) E supwtk(8) <oo, 

L 9ee J 

(ii) E[wth(9)] has a unique minimum at 9q, 

(iii) e\ sup w t \l t {9) - k(9*)\] as rj -> 0. 

Proof. First, by (A. 13) and (A. 14) in Ling (2007) and Assumptions 2.4 
and 2.5, it follows that 



E 



wt\e t {l)\ 
sup — 

.0ee \Jh t {9) 
for some p G (0, 1), and 



<OE[iut£,t-i(l + M)]<oo 



sup^t log yjht{6) 



< oo; 



see Ling (2007), page 864. Thus, (i) holds. 
Next, by a direct calculation, we have 

w t \et{l Q ) + (l-lv)'{de t {e)/d9)\ 



E[w t l t {8)] = E 
= E 
> E 

= E 



w t log y/h t {9) + 



w t log y/h t {8) + 



Et(lo) + (7-7o) 



,«9e 4 (e 



90 



w t \og^Jh t (9) + 



u; t lo£ 



M*) /M#o) 



h t (9 ) I/ ^(0) 



+ £7[iot log Aw 
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where the last inequality holds since r\t has a unique median 0, and obtains 
the minimum if and only if 7 = 70 a.s.; see Ling (2007). Here, £* lies be- 
tween 7 and 70. Considering the function f(x) = logx + a/x when a > 0, it 
reaches the minimum at x = a. Thus, E[wth{9)\ reaches the minimum if and 
only if yjht{6) = \/h t (9o) a.s., and hence 9 = 9q; see Ling (2007). Thus, we 
can claim that E[wtlt(0)] is uniformly minimized at 60, that is, (ii) holds. 

Third, let 9* = (7*' \S*')' G 6. For any 9 G B v (9*), using Taylor's expan- 
sion, we can see that 

[6-9*)' dh t (9**) 



log Vh{9)- log v%(#*) 



2h t {9**) 89 



where 9** lies between 9 and 9*. By Lemma A.l(iii)-(iv) and Assump- 
tion 2.4, for some p G (0, 1), we have 



E 



sup w t |logVW)-logV^(0*)l ^c^K^^o 



as 77 — 7- 0. Similarly, 



sup 



sup w t |e t (7*)| 

6>€B„(0*) 



kt(7)l - kt(7* 



as 77 — » 0, 



as 77 — > 0. 



Then, it follows that (iii) holds. This completes all of the proofs of Lemma A.2. 

□ 



Proof of Theorem 2.1. We use the method in Huber (1967). Let V be 
any open neighborhood of 9q G 0. By Lemma A.2(iii), for any 9* G V c = Q/V 
and e > 0, there exists an r/o > such that 



(A.l) 



E 



.eeB 



no 1 



inf w t l t (9) >E[w t l t (9*)]-£. 



From Lemma A.2(i), by the ergodic theorem, it follows that 



(A.2) 



1 



7? 



V inf w t l t (9)>E\ inf w t l t (9) 



t=i 



eeB V0 (e*) 



as n is large enough. Since V c is compact, we can choose {B Vo (9i) :9. L G 
V c , i = 1, 2, . . . , k} to be a finite covering of V c . Thus, from (A.l) and (A.2), 
we have 

inf L sn {9) = min inf L sn {9) 
eev c i<i<keeB V0 (e t ) 
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1 " 

(A.3) > min -V inf w t UO) 

i<i<k n f^eeB V0 (e,) 

> min E\ inf wthiQ) 
i<i<k ieeB V0 (ei) 

as n is large enough. Note that the infimum on the compact set V c is at- 
tained. For each 6^ £ V c , from Lemma A.2(ii), there exists an eo > such 
that 

(A.4) E\ inf wM8)]>E[w t l t (8 )} + 3e . 

Thus, from (A.3) and (A.4), taking e = Eo, it follows that 
(A.5) infL„,(0) > E[w t l t (9 )} + 2e . 

On the other hand, by the ergodic theorem, it follows that 

1 " 

(A.6) inf L sn {9) < L sn (9 ) -V^fflo) < E[w t l t (9 )} + e . 



Hence, combining (A.5) and (A.6), it gives us 

inf L sn (9) : 
e&v c 

which implies that 



inf L sn (9) > E[w t l t (9 )} + 2e > E[w t lt(9 )} + e > inf L sn (9), 
eev c eev 



9 sn G V a.s. for VV, as n is large enough. 

By the arbitrariness of V, it yields 9 sn — > 9q a.s. This completes the proof. 
□ 
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